Learning Wake-Sleep Recurrent Attention Models
نویسندگان
چکیده
Despite their success, convolutional neural networks are computationally expensive because they must examine all image locations. Stochastic attention-based models have been shown to improve computational efficiency at test time, but they remain difficult to train because of intractable posterior inference and high variance in the stochastic gradient estimates. Borrowing techniques from the literature on training deep generative models, we present the Wake-Sleep Recurrent Attention Model, a method for training stochastic attention networks which improves posterior inference and which reduces the variability in the stochastic gradients. We show that our method can greatly speed up the training time for stochastic attention networks in the domains of image classification and caption generation.
منابع مشابه
بررسی شیوع اختلالات خواب و اختلالات یادگیری عصب روانشناختی در کودکان پیش از دبستان
Introduction: The prevalence of sleep disorders is different in international studies. Sleep disorders with the increasing prevalence among children is common. Cognitive problems are the most serious complication of sleep disorders in children. The present study, the prevalence of sleep problems and neuropsychological learning disabilities were evaluated on pre-school children (4-6 years old) i...
متن کاملA wake-sleep algorithm for recurrent, spiking neural networks
We investigate a recently proposed model for cortical computation which performs relational inference. It consists of several interconnected, structurally equivalent populations of leaky integrate-and-fire (LIF) neurons, which are trained in a selforganized fashion with spike-timing dependent plasticity (STDP). Despite its robust learning dynamics, the model is susceptible to a problem typical ...
متن کاملComorbidity of Non-24-hour Sleep-wake Syndrome and Seasonal Affective Disorder in a Young Man: a Case Report
Objective: Few clinical reports have described in detail the comorbidity of seasonal affective disorder (SAD) and non-24-hour sleep-wake syndrome (non-24-SW). Both SAD and non-24-SW are thought to be caused by the interplay between internal clock dysfunction and insufficient external time cues. The aim of this study is to present and discuss in detail a subtype of psychiatric comorbidity and it...
متن کاملFactor Analysis Using Delta-Rule Wake-Sleep Learning
We describe a linear network that models correlations between real-valued visible variables using one or more real-valued hidden variables-a factor analysis model. This model can be seen as a linear version of the Helmholtz machine, and its parameters can be learned using the wake-sleep method, in which learning of the primary generative model is assisted by a recognition model, whose role is t...
متن کاملConvergence of the Wake-Sleep Algorithm
The W-S (Wake-Sleep) algorithm is a simple learning rule for the models with hidden variables. It is shown that this algorithm can be applied to a factor analysis model which is a linear version of the Helmholtz machine. But even for a factor analysis model, the general convergence is not proved theoretically. In this article, we describe the geometrical understanding of the W-S algorithm in co...
متن کامل